(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



( ii) EP 0 856 803 A2 

EUROPEAN PATENT APPLICATION 



(4o) Date of publication. 


(51) IntCI. 6 : G06F 17/30 


ub.uo.iyyo Bulletin lyyo/o^ 


(21) Application number: 98300577.8 




(22) Date of filing: 27.01.1998 




(84) Designated Contracting States: 


• Kandasamy, Ramiah 


AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Cupertino, California 95014 (US) 


NL PT SE 


• Skier, John A. 


Designated Extension States: 


San Jose, California 95123 (US) 


AL LT LV MK RO SI 






(74) Representative: Lunt, Mark George Francis 


(30) Priority: 31.01.1997 US 792139 


Dibb Lupton Alsop 




Fountain Precinct 


(71) Applicant: Informix Software, Inc. 


Balm Green 


Menlo Park, California 94025 (US) 


Sheffield S1 1RZ (GB) 


(72) Inventors: 




• Balabine, Igor V. 




Cupertino, California 95014 (US) 





(54) File system interface to a database 

(57) Information in a database is accessed with a 
computer system by transforming a file system request 
from an application into a database query and retrieving 



information corresponding to the database query from 
the database. The retrieved information is made avail- 
able to the application as a file system object, for exam- 
ple, as a directory, a file, a link or a collection thereof. 
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Description 

Background 

This invention relates to accessing information in a 
database. 

A database is a body of information that is logically 
organized so that it can be retrieved, stored and 
searched in a coherent manner by a "database engine" 
-- a collection of software methods for retrieving or ma- 
nipulating data in the database. Databases generally fall 
into three categories: relational databases, object-ori- 
ented databases and object-relational databases. 

A relational database (RDB) is a collection of fixed- 
field two-dimensional tables that can be related (or 
"joined") to each other in virtually any manner a data- 
base developer chooses. The structure of a relational 
database can be modified by selectively redefining the 
relationships between the tables. A database engine 
may perform complex searches on a relational database 
quickly and easily by using any of various database que- 
ry protocols such as the method expressed by the Struc- 
tured Query Language (SQL) or by other mechanisms. 
The relationships between the tables enable results of 
a search to be automatically cross-referenced to corre- 
sponding information in other tables in the database. As 
shown in Fig. 1 , for example, a relational database 1 00 
includes a customer table 1 02 which is joined by a log- 
ical link 1 03 to an order table 1 04 which in turn is joined 
by a logical link 105 to an inventory table 106. A user 
may query the database 100, for example, for all order 
numbers higher than a threshold value. Because the or- 
der table 1 04 is joined with the customer table 1 02 and 
the inventory table 1 06, the list of order numbers result- 
ing from the query can be retrieved and displayed along 
with the respective customer names and inventory items 
that correspond to the identified order numbers. 

An object-oriented database (OODB) is a collection 
of "objects" ~ software elements that contain both data 
and rules for manipulating that data. In contrast to a re- 
lational database which can store only character-type 
data : an OODB can store data of virtually any type (text, 
3D graphic images, video clips, etc.). An OODB stores 
its constituent objects in a hierarchy of classes with as- 
sociated rules so that the OODB contains much of the 
logic it needs to do useful work. A relational database 
in contrast contains only data and must rely on external 
application software to perform useful functions with the 
data. 

A object-relational database (ORDB) is a hybrid of 
the other two types. Non-character data (e.g., an image 
file) may be stored and retrieved in an ORDB as a binary 
large object (BLOB) - an undifferentiated mass of data. 
Rules for manipulating the data contained within a 
BLOB (e.g., a utility for viewing image files) may be 
stored either within the database or external to it de- 
pending on the particular ORDB implementation. The 
Informix® Universal Server (I US®) is an example of an 



object-relational database management system (ORD- 
BMS) that internally stores the rules for manipulating 
BLOBs so that they may be treated as "native" data 
types - that is, data types that the ORDBMS itself has 
s the capability to manipulate. 

Information within a relational or an object-relational 
database typically is accessed by SQL-compliant com- 
puter programs that are written to accomplish a specific 
function. For example, a user may write a SQL program 
10 that retrieves a list of customer names from a database 
which stores customer information. Alternatively, many 
different application programs are available that support 
database queries and which allow a user to interactively 
formulate a database query by specifying an arbitrary 
15 set of criteria (e.g., the names of all out-of-state custom- 
ers with overdue accounts). This type of application pro- 
gram presents the user's database query to the data- 
base engine which retrieves the requested information 
from the database. Such application programs are re- 
20 ferred to as "database aware" because they are have 
the ability to interact with and manipulate databases. 

Most application programs, in contrast, are "data- 
base-unaware" meaning that they cannot access infor- 
mation stored in a database. Rather, database -unaware 
25 applications rely on file systems, such as the Network 
File System (NFS) developed by Sun Microsystems, 
Inc., for storing and retrieving information in discrete 
files. A database-unaware program stores each sepa- 
rate document in a separate disk file identified by the 
30 user of the application. In Fig. 2, for example, a file sys- 
tem 200 has two disk drives mounted: drive 202 which 
is mapped to the label a and drive 204 which is mapped 
to the label b: Each of the a and b drives includes one 
or more directories (docs on the a. drive 202; dirl and 
35 dir2 on the b: drive 204) which in turn may have subdi- 
rectories (subdirl in dirtl, subdir2 and subdir3 in 
dir2) and so on with virtually any level of hierarchical 
nesting being possible. Files 206-212 may exist at any 
of the various directory or subdirectory levels within the 
40 file system. The labels a. and b: represent the "name- 
space" of the file system. That is, all filename paths that 
begin with a: or b: are within the file system's name- 
space. As shown in Fig. 2, for example, a document that 
lists names of out-of-state customers is stored in the file 
45 system's namespace at a location defined by the filena- 
me path a -.\docs\cust_outstate.txt which means that 
a file 211 named "cust_outstate" of the type "txt" is 
stored in a directory named docs on a disk drive 202 
mapped to the label a.. Another document that lists 
50 names of customers with overdue accounts is stored in 
a separate disk file located at the filename path a: 
\docs\cust_overdue.txt. These two files are separate 
and distinct entities that are not related or joined in the 
sense that tables in a database are related. 

55 

Summary 

In one aspect of the invention, information in a da- 
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tabase is accessed with a computer system by making 
one or more database objects (e.g., a table or a row) 
available as one or more file system objects (e.g., direc- 
tories, files or links) to an application, for example : a da- 
tabase-unaware application. The database may be re- 
lational, object-relational or object-oriented. If multiple 
file system objects are made available, collectively they 
may represent a hierarchical file system. A file system 
request issued by the application that corresponds to 
the file system object is transformed into a database op- 
eration, for example, an SQL query, which is performed 
on the database with a database engine. 

Information associated with the database object 
which is retrieved as a result of the database operation 
may be formatted into one or more file system objects 
and returned to the application. The particularformatting 
of the retrieved information may be defined in an exten- 
sion module, which also may include information that 
defines the specific manner in which the file system re- 
quest should be transformed into a database query. The 
database operations, including formatting of a database 
query, retrieving information and formatting it into file 
system objects ; are performed transparently to the ap- 
plication. 

Upon receiving the file system objects, the applica- 
tion may display them on a display screen of a computer, 
for example, as graphical representations of file system 
objects. The database object that is made available may 
be presented as multiple file system objects in formats 
understandable by different applications. Conversely a 
single file system object may correspond to multiple da- 
tabase objects. 

In another aspect, a computer-based data reposi- 
tory management system includes a database of infor- 
mation, a file system-based application program for ma- 
nipulating data, and a file system interface to the data- 
base which provides the file system-based application, 
which otherwise may be database-unaware, with ac- 
cess to information in the database. The data repository 
management system may further include a database 
management system which manages information in the 
database either in addition to, or instead of, the file sys- 
tem-based application. 

The data repository management system may in- 
clude a module for differentiating file system requests 
directed to the file system from file system requests di- 
rected to the file system interface. The file system inter- 
face may include one or more extension modules con- 
taining one or more file objects, each file object including 
information for converting database objects into file sys- 
tem objects. 

In another aspect, information in a database is ac- 
cessed with a computer system by encoding a file han- 
dle with information that specifies a database object in 
a database. In response to a file system request issued 
by an application, the encoded file handle is transmitted 
and then decoded to identify the database object asso- 
ciated with the file system request. The encoding may 



be based on the NFS protocol. The encoded information 
may include information that corresponds to the issued 
file system request and which identifies an extension 
module, a database table and row, metadata, a pointer 

5 to a database object, or a combination thereof. 

Advantages of the file system interface described 
here may include one or more of the following. Applica- 
tions that rely on a file system as a data repository, or 
which are otherwise database-unaware (i.e., unable to 

10 access data in a database), are enabled to access in- 
formation in a database in a transparent manner. These 
database-unaware applications can share data seam- 
lessly both with database -aware applications and with 
other database-unaware applications. Under IXFS, a 

15 database may appear to an application as just another 
local or remote file system that is no different in form or 
character from the other file systems available to the ap- 
plication. No change to the application's program code, 
the database or the database engine is required. As a 

20 result, users of database-unaware applications are pro- 
vided with database functionality without having to in- 
vest the time and cost typically associated with data- 
base-aware tools. 

A system administrator may use the IXFS system 

25 to combine disparate data storage technologies (e.g., 
fileU based systems with database systems) in creating 
a unified data repository strategy that spans an enter- 
prise. The enterprise's investment in legacy data repos- 
itories is maintained because data present in the repos- 

30 itories may easily be transferred to a database as the 
enterprise moves to the relational or object-relational 
model of data storage. Moreover, the enterprise's in- 
vestment in database-unaware applications is en- 
hanced because IXFS enables them to be used to man- 

35 age data stored in a database. 

The ability for a database-unaware application to 
access information in a database combines the simplic- 
ity of the file system paradigm with the sophistication 
and effectiveness of database manipulation techniques. 

40 This capability is particularly useful for Internet World 
Wide Web applications in which a user seeks to access 
a large store of data using, for example, the hypertext 
transfer protocol (HTTP). In contrast to a common gate- 
way interface (CGI) script, which spawns an external ap- 

45 plication to retrieve data from a database in response to 
a URL (Uniform Resource Locator) encoded request, 
the IXFS system converts such a request into a form 
that may be executed by a database engine directly 
quickly and transparently. 

50 The ability to represent an arbitrary collection of ta- 
bles in a database as various file system objects pro- 
vides a software developer with a rich and flexible set 
of tools. The extensible nature of IXFS allows it to be 
tailored to virtually any type of application so that the 

55 database will appear as a collection of file system ob- 
jects that are consistent with the application's other file 
system objects. 

Other advantages and features will become appar- 
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ent from the following description, including the draw- 
ings and claims. 

Brief Description of the Drawings 

Fig. 1 is a diagram of a relational database. 

Fig. 2 is a diagram of a file system. 

Fig. 3 is a diagram of a system for accessing data 
in file system and in a database. 

Fig. 4 is a flowchart of accessing data in a database 
using the system of Fig. 3. 

Figs. 5A, 5B and 5C are example screen displays 
from an application accessing information in a file sys- 
tem and in a database. 

Fig. 6 is a data structure diagram for a file object. 

Fig. 7 is a diagram of a kernel level file system ar- 
chitecture. 

Fig. 8 is a diagram of a network file system archi- 
tecture. 

Fig. 9 is a data diagram of a NFS file handle as used 
in the network file system architecture of Fig. 8. 

Detailed Description 

The use of a database to store persistent data pro- 
vides several advantages that are not available when a 
file system is used as a data repository. The structure 
of a database, and the internal relationships between 
tables within the database, enable fast and arbitrarily 
complex queries for information to be performed on the 
database. A file system in contrast has no standard data 
query mechanism for searching for specific data items 
within the files managed by the file system. Other fea- 
tures provided by database systems for which conven- 
tional file systems have no analog include well-defined 
management policies, auditing capabilities, transparent 
data replication, logging facilities, and consistent back- 
up and restore procedures. 

Using a database system as a data repository re- 
quires relatively complex and expensive tools, such as 
special purpose database-aware applications, and of- 
ten an increased level of sophistication and training by 
the end-user. File systems in contrast generally are sim- 
ple to use, cheap and pervasive. Virtually every compu- 
ter operating system provides a native file system that 
may be used by applications for storing persistent data. 
Th is among other reasons is why approximately 85-90% 
of all persistent data is stored in file systems by data- 
base-unaware applications. 

The file system interface described here, dubbed 
the Informix® File System (IXFS) interface, provides 
computer system users with the best of both worlds by 
enabling database-unaware applications to access (i.e., 
read and write) information in a database in a manner 
that is entirely transparent to the application. No chang- 
es need to be made to the application or to the database. 
IXFS presents the contents of a database to the appli- 
cation as "file system objects" such as directories, sub- 



directories, files or links. These file system objects ap- 
pear to the application to be no different in form or char- 
acter from the file system objects that the application 
handles in the ordinary course of storing and retrieving 

5 data. IXFS enables a user of a database-unaware ap- 
plication to access the contents of a database by per- 
forming the desired operations on the file system objects 
that represent the database's contents. 

As shown in Fig. 3, IXFS 300 sits between a data- 

10 base-unaware application 302 and a database 304 and 
monitors all requests issued to the file system 306 by 
the application 302. When the application seeks to ac- 
cess information in the database, a component of the 
IXFS system translates the file system request into a 

15 database query format that is understandable by the da- 
tabase. Similarly, information received from the data- 
base (in response to a file system read request by the 
application, for example) is represented to the applica- 
tion as one or more file system objects. 

20 A high-level description of the operation of IXFS and 
its interaction with the computer's operating system is 
provided with reference to the flowchart of Fig. 4. When 
a file system request (e.g. a data read or write) is issued 
by an application program, the operating system deter- 

25 mines whether it corresponds to information contained 
in a file namespace managed by IXFS (step 400). The 
operating system is able to differentiate requests for da- 
ta stored in other file systems from requests for data in 
IXFS's file namespace because the database has been 

30 mapped to a namespace (e.g., x:) that is mutually ex- 
clusive with the file system's namespace (e.g., a. and 
b:). In effect, the database appears to the operating sys- 
tem and to the application as a disk drive mapped to the 
label x:. 

35 |f the file system request is not directed towards in- 
formation managed by IXFS, the request is handled by 
other file systems (step 401). If, on the other hand, the 
file system request corresponds to information in IXFS's 
file namespace, the operating system passes the re- 

40 quest onto IXFS which in turn furnishes the request to 
an extensible component of IXFS, i.e., an "extension 
module," for translation into a form understandable by 
the database - an SQL query, for example -- (step 402). 
After the request has translated into a database query, 

45 the IXFS extension module presents the query to the 
database engine which uses it to access the database 
either by modifying the desired information (for a write 
request) or by retrieving the desired information (for a 
read request) and returning it to IXFS (step 404). If in- 

50 formation has been retrieved from the database (step 
406), the IXFS extension module formats it according to 
predefined criteria into file system objects (step 408) 
which are presented to the application (step 41 0). Upon 
receiving the file system objects from IXFS, the applica- 

55 tion treats them as if they came from a file system. In 
fact, the application is unaware that the file system ob- 
jects came from a source other than a file system. In this 
manner, all requests for data in the file system's name- 
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space are handled by the file system while all requests 
for data in the file namespace assigned to the database 
are handled by IXFS. 

An example of how IXFS may be used to represent 
a database as a file system to an application is provided 
with reference to Figs. 5A-5C. Assume that a user of a 
window-based computer system uses a file system nav- 
igation tool to examine the information that is stored both 
in the file system represented by Fig. 2 and in the data- 
base represented by Fig. 1 . Assume further that the file 
system's namespace is represented by the labels a: and 
b: and that IXFS is mapped to drive x: on the client ma- 
chine. As shown in Fig. 5A, the navigation tool window 
500 initially displays the file system's two drives, a: and 
b:, and the drive x: corresponding to the database, in a 
collapsed state. At this point the user instructs the nav- 
igation tool to expand drive b: r thereby making its hier- 
archy of directories and subdirectories visible to the us- 
er, and opens subdirl which contains two files, 
doc206.txt and doc207.txt, as shown in Fig. 5B. The 
file information displayed in Fig. 5B is retrieved from the 
a: and b: drives using standard file system operations. 

Next, the user instructs the navigation tool to ex- 
pand drive x:, which is mapped to the database via 
IXFS, so that the contents of drive x: may be examined. 
Because the corresponding file system request issued 
by the navigation tool points to drive x. - the file name- 
space assigned to the database -- IXFS handles the file 
system request by passing it to an extension module 
which formulates a database query to retrieve the re- 
quested information from the database. After the infor- 
mation has been retrieved, it is formatted into file system 
objects with a method invoked by IXFS and returned to 
the navigation tool. The information retrieved from the 
database appears to the navigation tool, and to the user 
of the navigation tool to be no different in characterf rom 
other file system objects that were retrieved with the file 
system. As shown in Fig. 5C, tables 102, 104 and 106 
in the database 100 of Fig. 1 are represented as three 
corresponding directories —customer, order and in- 
ventory. Similarly, three rows within the customer table 
102 - customer_name, customer_addr and 
customeMd - are represented as three corresponding 
subdirectories within the customer directory - name, 
address and id Entries in the name subdirectory are 
represented as text files that are named for their respec- 
tive contents - Adams, Andrews ; Brewster, etc. 

A user may open any of the text files in the x:\cus- 
tomer\name directory (for example, with a standard text 
editor application) modify its contents, and perform a 
standard "file save" operation. In response, IXFS han- 
dles the file save request because it is directed to the 
file namespace assigned to the database and formu- 
lates a corresponding database operation to modify the 
contents of the database as appropriate. 

IXFS allows all file system operations to be per- 
formed on the database. For example, a user could em- 
ploy appropriate features of the navigation tool to 



change the name of the x:\customer directory to some- 
thing else such as x:\cust. Similarly, a user could create 
a new file system object such as a subdirectory or a new 
file underneath the x:\customer directory. Moreover, 
5 access to specified portions of the database could be 
limited for certain users in the same manner that file sys- 
tem objects in a file system may be limited (read only, 
hidden, etc.). 

The specific types, formats and arrangement of file 
system objects that IXFS will return in response to a file 
system request are defined in a corresponding exten- 
sion module - a software component of IXFS that may 
be tailored as desired to encapsulate an arbitrary col- 
lection of database objects (e.g., tables) and represent 
them as a collection of file system objects. In one imple- 
mentation, IXFS includes a Basic Extension Module 
(BEM) that provides a one-to-one mapping of a file in a 
file system into a collection of database objects. Among 
other uses, the BEM allows users to quickly and trans- 
parently move their data from a file system into the I US® 
database management system and run database que- 
ries against it. 

The BEM emulates a file system by encapsulating 
a collection of database tables as specified by a soft- 
ware developer implementing the IXFS system, and 
presenting them to an application as file system objects. 
Each table specified by the BEM corresponds to a di- 
rectory and each row in the table corresponds to a file 
system object (e.g., subdirectory, file or link) present 
within the directory. 

For each database table that it encapsulates, the 
BEM includes a corresponding "file object" 600 having 
a data structure as shown in Fig. 6. The file system ob- 
ject 600 corresponds to, and provides an intuitive rep- 
resentation of, a directory, a file or a link in a file system. 
Each file object 600 includes the file object's name 601 
(an identifier of a file system entity that is unique within 
a given directory), type 602 (directory, file or link), own- 
ership 603 (an identifier of the file object's owner), ac- 
cess rights 604 (access rights to the object for its owner, 
community and others), temporal characteristics 605 
(timestamp of last read, write and look-up operations), 
popularity 606 (number of links pointing to the object) 
and size 607 (object's size in bytes). The file object 600 
also contains its corresponding data object 608 or a 
pointer to the data object. 

Portions of a database are mapped to a file system 
representation by selecting database tables and rows 
as desired, and by designating the type of file system 
object to which each selected table and row corre- 
sponds. For example, the database of Fig. 1 was 
mapped into the file system hierarchy shown in Fig. 5C 
by specifying that each of the customer, order and in- 
ventory tables occupy a separate file object in the BEM 
of the type "directory." Within the file object for the cus- 
tomer table, each of the name, address and id rows have 
been designated as the type "directory, "thereby making 
them appear as subdirectories to the hierarchically dom- 
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inant customer directory. Within the "name" row in the 
"customer" table, the individual customer name entries 
have been designated in the file object as the type "file" 
making them appear as individual text files as shown in 
Fig. 5C. 

Several different IXFS extension modules may be 
resident and operative at the same time to provide ac- 
cess to two or more different databases simultaneously 
or to access different information within the same data- 
base or to provide a different interpretation of the same 
database object. A single extension module is capable 
of presenting the same information in multiple different 
formats, for example, as different types of file system 
objects. In Fig. 5C, for example, the table of customers, 
including their names, addresses and IDs, could be pre- 
sented as a single file system object -e.g., a Microsoft 
Excel file named "customer.xls" containing all of the 
customers' identifying information -- which could be 
opened by an appropriate spreadsheet program that un- 
derstands the "xls" format. The extension module could 
be configured so that the customer.xls file object is pre- 
sented to the application either instead of, or in addition 
to, the x;lct/s tomer directory its component subdirec- 
tories (name, address, /c/j and the files contained there- 
in (Adams.txt, Andrews.txt, Brewster.txt, etc.). 

As another example, an extension module could be 
configured to present the text files in x:\customer\name 
in several different formats for use by alternative appli- 
cation programs. In the database of Fig. 1 , for example, 
multiple different file formats could be provided for each 
customer name by presenting multiple file system ob- 
jects for a single database object. The database table 
entry forthe customer Adams, could be mapped, for ex- 
ample, to three separate file system objects having dif- 
ferent formats: "Adams.doc" for use with Microsoft 
Word, "Adams. wpd" for use with Corel Wordperfect, and 
"Adams.fm" for use with Adobe Framemaker. A user 
who edited the information in the "Adams.doc" object 
would observe that the changes automatically were re- 
flected in the "Adams. wpd" and "Adams.fm" objects. Be- 
cause all three of the file system objects are mapped to 
the same database object (namely the database entry 
for customer Adams), the three alternative file system 
objects may be used interchangeably to view or edit the 
information for customer Adams without concern that di- 
vergent versions of Adams' information will result. 

By employing the appropriate extension modules, 
whether obtained from a software library or generated 
according to custom specifications, software develop- 
ers may enable database-unaware applications (e.g., 
Microsoft Word, Microsoft Excel, Lotus 1 -2-3) to retrieve 
information stored in a database or to store new or mod- 
ified information into the database. At the same time, 
database-aware applications may continue to access all 
of the information stored within the database, including 
information that was stored by database-unaware appli- 
cations in the first instance. Together these capabilities 
enable a single enterprise-wide data repository to be 



maintained with various different applications, both da- 
tabase-aware and database-unaware, being able to ac- 
cess the information in the data repository. Moreover, 
IXFS facilitates the migration of data between different 

5 applications - for example, between a database-aware 
application and a database-unaware application or be- 
tween two disparate database-unaware applications. 

The IXFS system may be implemented by three dif- 
ferent architectures: an object library architecture; a ker- 

10 nel level mountable file system architecture; or a net- 
work file system architecture. 

In the first approach, the object library architecture, 
the ability to access information in a database is 
achieved through a set of software objects made avail- 
's able to database-unaware applications through a library 
- for example, a dynamic linked library (DLL) on a Mi- 
crosoft Windows®-based platform. Using a consistent 
set of file system access methods that operate on the 
database, these software objects provide a functionality 

20 analogous to that provided by the common file access 
Application Program Interfaces (APIs) defined by the 
ANSI C or POSIX standards. Use of the object library 
architecture would require, however, any application to 
be used with the IXFS system first to be relinked with a 

25 new library of IXFS-related objects. The other two archi- 
tectures, in contrast, allow existing applications to ac- 
cess database information without any changes to or re- 
linking of the applications. 

The kernel level mountable file system architecture, 

30 illustrated in Fig. 7, intercepts file system requests at the 
operating system level and passes them on to the IXFS 
system for processing. In the kernel architecture, the 
kernel address space 700 is modified to include an IXFS 
kernel module 701 which is specific to the operating sys- 

35 tern being used (e.g., UNIX, Windows® NT). File system 
requests from application programs are received by the 
OS kernel 702 either from a local client 705 (an appli- 
cation residing in the address space of the local host 
704) or from a remote client 706 (an application residing 

40 in the address space of a remote system) via NFS 703. 
File system requests directed towards the name- 
space of the mounted file system devices (e.g., disk 
drives) are handled by NFS 703 in the conventional 
manner. File system requests that are directed to the 

45 namespace occupied by the database (as determined 
by the IXFS kernel module 701) are passed onto the 
IXFS daemon 708 (or the IXFS "service" in the case of 
Windows NT) for processing. The IXFS daemon 708 
performs several functions including managing connec- 

50 tions to the database 709 (or databases) to be accessed 
and maintaining a list of the filenames being used to ac- 
cess database objects. Upon receiving a request from 
the IXFS kernel module 701 to access the database 
709, the IXFS daemon initializes a filename look-up pro- 

55 cedure to identify the appropriate IXFS extension mod- 
ule 707 to handle the request. The filename specified 
by the request is used as an index into a look-up table 
of corresponding IXFS extension modules by compar- 
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ing the specified filename to a list of names of file objects 
contained in each extension module. After determining 
which IXFS extension module includes a file object 
whose name matches the specified filename, the IXFS 
daemon 708 transfers the request to that extension 
module 707. The extension module translates the re- 
quest into a database operation which is performed on 
the database 709. Any information generated in re- 
sponse to the database query operation is formatted by 
a method invoked by the IXFS extension module 707 
into file system objects according to the file object types 
defined in the extension module. The formatted file sys- 
tem objects are then presented to the requesting appli- 
cation. 

The network architecture, illustrated in Fig. 8, inter- 
cepts file system requests at the network level and pass- 
es them on to the IXFS system for processing. In the 
network architecture, the OS kernel address space 
need not be modified. Rather, all file system commands 
generated by a local client 805 or a remote client 806 
are passed, via a loop NFS connection 810 or a network 
NFS connection 803, to a NFS front-end daemon (or 
service) 804, which resides outside of the OS kernel. 
The NFS front-end daemon 804 is implemented as a 
component of the IXFS daemon. 

Upon receiving a file system request the NFS front- 
end daemon 804 passes it on to the IXFS daemon 806 
and subsequently to the appropriate IXFS extension 
module 807, which provide the same functionality as the 
IXFS daemon and IXFS extension modules in the kernel 
level architecture described above. 

The IXFS system can be adapted to provide an in- 
terface to any type of database, including relational ob- 
ject-relational and object-oriented databases, and to un- 
derstand any other type of database query protocol in 
addition to SQL. NFS was chosen as the network pro- 
tocol to be used in the IXFS implementation of Fig. 8 
because NFS is a widely used standard for sharing files 
across different platforms. However, any other network 
protocol that provides access to file systems over a net- 
work -for example, Microsoft's Common Internet File 
System (CIFS) - could be used in implementing the 
IXFS network architecture. 

The NFS protocol (version 2) allows clients and 
servers to exchange file information by using a "file han- 
dle" - a 32-byte value ~ to identify a desired file. When 
NFS is used as the network file system protocol in the 
network architecture, IXFS can make additional use of 
the NFS file handle to achieve greater speed and effi- 
ciency in performing file system requests on the data- 
base. To do so, IXFS encodes the NFS file handle with 
information that is specific to its operations as shown in 
Fig. 9. 

Bytes 1-8 of the file handle hold the IXFS "magic 
string" - an entity that allows IXFS to distinguish IXFS 
file handles (i.e., file system requests directed to the file 
namespace assigned to the database) from NFS file 
handles (i.e., file system requests directed to the file 



namespace assigned to the file system). The magic 
string is an eight byte string in which each byte is as- 
signed the value FF hexadecimal to identify afile handle 
as an IXFS file handle. Bytes 9 and 10 identify the par- 

5 ticular IXFS extension module whose job it is to manage 
the file handle under consideration. Encoding an IXFS 
file handle in this manner obviates the need to maintain 
an empty ("shadow") directory tree in the file system, 
which otherwise would be needed to generate standard 

10 NFS file handles that correspond to information man- 
aged by the IXFS system. Similarly, this encoding 
scheme makes it unnecessary to maintain a distinct 
mapping entity (e.g., look-up table) between NFS file 
handles and IXFS file handles. 

15 The remaining bytes of the NFS file handle in Fig. 
9 include information that is specific to the extension 
module identified by the extension module identified by 
bytes 9 and 10 of the file handle. Bytes 11 -14 and 15-18 
respectively identify the database table and row that cor- 

20 respond to the file handle. Bytes 19-22 identify the i- 
node (information node) table and row which corre- 
spond to metadata descriptive of file attributes, and a 
pointer to the data in the database, for the file handle 
under consideration. Bytes 19-32 presently are unused 

25 but they are available for use by any newly developed 
extension modules. As a result of the encoding repre- 
sented in Fig. 9, the efficiency with which elements may 
be located in the database is enhanced and the com- 
plexity of designing new IXFS extension modules is re- 

30 duced. 

The kernel level and network architectures provide 
different advantages relative to one another. The kernel 
level approach is the more efficient of the two in that it 
provides a shorter data path between the issuance of 

35 the file system request to the return of file system objects 
from IXFS. On the other hand, the network-based archi- 
tecture significantly minimizes the effort required to port 
the IXFS between different platforms because imple- 
menting the network architecture does not require a 

40 modification to the operating system kernel. In both cas- 
es, however, the IXFS system provides a database-un- 
aware application with transparent access to data in a 
database, while maintaining the inherent advantages of 
using a database for persistent data storage. 

45 The methods and mechanisms described here are 
not limited to any particular hardware or software con- 
figuration, but rather they may find applicability in any 
computing or processing environment in which data- 
base manipulation may be performed. The tecnniques 

50 described here may be implemented in hardware or 
software, or a combination of the two. preferably, the 
technique are implemented in computer programs exe- 
cuting on programmable computers that each include a 
processor a storage medium readable by the processor 

55 (including volatile and non-volatile memory and/or stor- 
age elements), and suitable input and output devices. 
Program code is applied to data entered using the input 
device to perform the functions described and to gener- 
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ate output information. The out put information is applied 
to one or more output devices. 

Each program is preferably implemented in a high 
level procedural or object oriented programming lan- 
guage to communicate with a computer system. How- 
ever, the programmes can be implemented in assembly 
or machine language, if desired. In any case, the lan- 
guage may be a compiled or interpreted language. 

Each such computer program is preferably stored 
on a storage medium or device (eg., CD-ROM, hard disk 
or magnetic diskette) that is readable by a general or 
special purpose programmable computer for configur- 
ing and operating the computer when the storage me- 
dium or device is read by the computer to perform the 
procedures described in this document. The system 
may also be considered to be implemented as a com- 
puter-readable storage medium, configured causes a 
computer to operate in a specific and predefined man- 
ner. 

Accordingly, an aspect of the present invention pro- 
vides computer software, residing on a computer read- 
able medium, for a data repository management system 
comprising a file system-based application and a data- 
base, the computer program comprising instruction to 
cause a computer system to perform the following op- 
erations: 

receive a file system request issued by the file sys- 
tem-based application; 

convert the file system request into a database op- 
eration; 

retrieve information from the database by perform- 
ing the data base operation on the database; 
transform the retrieved information into one or more 
file system objects according to predetermined cri- 
teria; and 

present the one or more file system objects to the 
file system-base application. 

A still further aspect provides a computer program, 
residing on a computer readable medium comprising in- 
struction for implementing any method or system as de- 
scribed herein. 



Claims 

1 . A method, performed on a computer system, of ac- 
cessing information in a database, the method com- 
prising steps of: 

retrieving information from the database; and 
presenting the retrieved information to an ap- 
plication as a file system object. 

2. A method as claimed in claim 1 wherein the step of 
retrieving comprises the steps of: 



making a database object available to an appli- 
cation as a file system object; 
receiving from the application a file system re- 
quest corresponding to the available file system 
s object; and 

transforming the file system request from the 
application into a database operation. 

3. The method of claim 2 further comprising perform- 
10 ing the database operation on the database using 

a database engines. 

4. The method of either of claims 2 or 3, further com- 
prising retrieving information from the database in 

15 response to the database operation. 

5. The method of any preceding claim further compris- 
ing returning the information associated with the da- 
tabase object to the application as a file system ob- 

20 ject. 

6. The method of claim 5 in which the returning com- 
prises arranging the retrieved information in a for- 
mat defined in an extension module. 

25 

7. The method of claim 6 further comprising displaying 
the formatted information on a screen of the com- 
puter system. 

30 8. The method of any preceding claim further compris- 
ing the displaying information associated with the 
database object on a display screen of the compu- 
ter system. 

35 9. The method of claim 8 in which the information as- 
sociated with the database object is displayed as a 
graphical presentation of a file system object. 

10. The method of any preceding claim in which the ap- 
40 plication that issued the file system request com- 
prises a database unaware application. 

11. The method of any preceding claim in which the da- 
tabase comprises a relational database; or 

45 

in which the database comprises an object-re- 
lational database; or 

in which the database comprises an object-ori- 
50 ented database. 

12. The method of any preceding claim in which the 
transforming comprises converting the file system 
request into a database query. 

55 

13. The method of claim 1 2 in wh ich the database query 
comprises an SQL-compliant query. 
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14. The method of either of claims 12 or 13 in which the 
transforming comprises converting the file system 
request into a database query based on information 
contained within an extension module. 

15. The method of any preceding claim further compris- 
ing determining whether the received file system re- 
quest corresponds to information managed by a file 
system; and, preferably, further comprising convey- 
ing the file system request to the file system if the 
file system request is determined to correspond to 
information managed by the file system. 

16. The method of claim 15 in which the determining 
comprises identifying a namespace to which the file 
system request is directed. 

17. The method of either of claim 15 or 16 in which the 
determining is performed by a method executing in 
a kernel address space of an operating system as- 
sociated with the computer system; or 

in which the determining is performed by a 
method executing externally to an operating 
system associated with the computer system. 

18. The method of any preceding claim in which making 
the database object available as a file system object 
is performed transparently to the application. 

19. The method of any preceding claim in which the da- 
tabase object comprises a table. 

20. The method of any preceding claim in which the file 
system object comprises a directory a file or a link. 

21 . The method of any preceding claim in which making 
the database object available as a file object com- 
prises presenting the database object as a plurality 
of file system objects in formats which are under- 
standable by different applications. 

22. The method of any preceding claim further compris- 
ing identifying the information to be retrieved from 
the database by specifying a database object from 
among database objects in the database. 

23. The method of claim 22 in which the identifying 
comprises recording in an extension module infor- 
mation descriptive of a predetermined set of data- 
base objects. 

24. The method of claim 23 in which the receiving com- 
prises selectively extracting information from the 
database based on information contained in the ex- 
tension module. 

25. The method of claim 24 in which the presenting 



comprises formatting the selectively extracted infor- 
mation into file system objects based on information 
contained in the extension module. 

5 26. The method of an preceding claim in which a col- 
lection of file system objects represents a hierarchi- 
cal file system having directories, files, links or a 
combination thereof. 

10 27. The method of any preceding claim further compris- 
ing providing a file system interface to the database 
which provides the application, preferably a file sys- 
tem based application, with access to information 
in the database. 

15 

28. The method of claim 27 further comprising provid- 
ing a module for differentiating file system requests 
directed to the file system from file system requests 
directed to the file system interface. 

20 

29. The method of either of claims 27 or 28 in which the 
file system interface further comprises an extension 
module that includes a file object. 

25 30. The method as claimed in any of claims 27 to 29 in 
which the file object comprises information for con- 
verting database objects into file system objects. 

31. The method of any of claims 27 to 30 in which the 
30 file system interface comprises a plurality of exten- 
sion modules each of which includes information for 
converting a different set of database objects into 
file system objects. 

35 32. The method of any of claims 27 to 31 in which the 
file system interface comprises a plurality of exten- 
sion modules each of which includes information for 
converting a single database object into multiple file 
system objects. 

40 

33. A computer based method of accessing information 
in a database, the method comprising: 

encoding a file handle with information that 
45 specifies a database object in a database; 

transmitting the encoded file handle in re- 
sponse to a file system request issued by an 
application; and 

50 

decoding the received file handle to identify the 
database object associated with the file system 
request. 

55 34. The method of claim 33 in which the encoding is 
based on the NFS protocol; or 

in which the encoding comprises including in- 
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formation in the file handle that identifies an ex- 
tension module that corresponds to the issued 
file system request; or 

in which the encoding comprises including in- s 
formation in the file handle that identifies as a 
database table and row that corresponds to the 
issued file system request; or 

in which the encoding comprises including in- 10 
formation in the file handle that identifies meta- 
data that corresponds to the issued file system 
request; or 

in which the encoding comprises including in- 15 
formation in the file handle that points to the da- 
tabase object that corresponds to the issued 
file system request. 

35. A system for accessing information in a database, 20 
the system comprising 

means for implementing a method as claimed 
in any preceding claim. 

25 

36. A system for accessing information in a database, 
the system comprising: 

means for making a database object available 

to an application as a file system object; 30 

means for receiving from the application a file 
system request corresponding to the available 
file system object; and 

35 

means for transforming the file system request 
from the application into a database operation. 
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